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Network Models for Power Grids: A 
Generative Approach 

Deepjyoti Deka and Sriram Vishwanath 

Abstract 

The inherent structure of the electrical network has a significant impact on its functioning and health. 
As power grids move towards becoming 'smarter' with increased demand response and decentralized 
control, the topological aspects of the grid have become even more important. Understanding the 
topology can thus lead to better strategies to control the smart grid as well as enable rapid identification 
and prevention of risks such as blackouts. This paper focuses on modeling and analyzing basic simi- 
larities in network structure of large power grids across America and Europe. It studies the topological 
characteristics of power grids and develops a generative model for them. Based on the typical small 
world characteristics, large scale test power systems can be developed to better study the working of 
new algorithms for smart grids. 

I. Introduction 

The modern-day electric/power grid is a large complex interactive system that affects the 
every-day activities of millions of people; serving all the other sectors of the economy. The 
emergence of a 'smart' grid has added further complexity to the system by bringing in new ideas 
of demand response, real-time pricing and flexible demand leading to increased control in the 
hands of end consumers. This, in turn, can lead to reduced pricing and more careful supervision 
of the grid to preempt/detect failures like cascading blackouts and intentional attacks. Studies 
on the performance of new technologies for implementation on the grid focus primarily on the 
test bus systems of different sizes, such as the IEEE 14-bus, IEEE 30-bus, IEEE 57-bus and 
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IEEE 118-bus systems. These bus systems and their data can be found at lfT4ll . However, for an 
improved understanding of the impact of new technologies for smart grids, test cases should be 
of the same scale as the large network of buses in the grid and should follow the general trends 
of the topology present in such massive networks. 

Existing analysis has revealed the connection between the working and robustness of the grid 
to its topological characteristics. Albert et al. 0] and others have established the importance of 
understanding the physical network that describes the topological structure of the power grid (as 
represented by the adjacency matrix and/or the network's degree distribution) in determining the 
vulnerability of the grid to random or directed attacks. Similar work can also be found for other 
networks, including social networks fl4), the Internet (3]| and population models for spread of 
diseases flU and effective vaccination [ITOll . In this context, Watts and Strogatz [fTTTl first modeled 
the power grid's physical-network statistically as having 'small-world' behavior and measured 
features of the connectivity graph, such as characteristic path length, clustering and average 
degree. Despite the fact that the power grids possess obvious 'small world' characteristics, it 
emerges that there is some variation within them compared to other types of networks (such as 
online social networks). 

In particular, there have been several papers on the degree distribution of the underlying 
physical network associated with large power grids. Though there has been some work reporting 
power-law/scale free distributions, a considerable body of work reports an exponential degree 
distribution or, in the least, an exponential tail. H2] reports the presence of exponential degree 
distribution while considering power plants, substations, and 115 - 765 kV power lines of the 
North American power grid. The exponential degree distribution is also observed for the western 
power grid as reported in lfT3l . Hines et al. in [Qj] also suggests an exponential degree distribution 
in The Eastern, Western and Texas Interconnects and reject the power-law degree distribution for 
these grids as well as for the 300 bus IEEE test case. A very clear exponential tail distribution 
has also been reported by ifToTl for the NYISO. Similarly, 0, flU have mentioned the 
exponential degree distribution of the power grids of Europe - namely in Italy, UK, Ireland, 
Portugal as well as for the entire connected component of Europe obtained from the Union for 
the Co-ordination of Transmission Electricity, UCTE. Although power grids in different countries 
possess different average degree per node due to their different geographical terrain and shape, an 
exponential degree distribution seems widespread across regions and large grids. Even though 
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the distribution has been reported to be exponential in existing literature, a generative model 
motivated by physical characteristics that sheds light on the structure of power grids in general, 
and its exponential distribution in particular, is yet absent. Such generative models play a critical 
role in enabling the development of an analytical framework for power grids, similar to the role 
played by generative models for social and biological networks. 

Our paper is dedicated to providing a comprehensive and meaningful generative framework for 
understanding the physical network associated with power grids. Our approach differs from others 
in that we look at the physical network evolution as a generative process and present a model that 
aims to explain the exponential degree distribution for the power network. We present a model 
(in the mean-field sense) that attempts to capture the distribution and the intuition underlying 
such a model. 

The main results of this paper are as follows: 

• We use a spatial Poisson point process to develop an understanding of the creation of buses 
for the physical network corresponding to power grids and look at its evolution over time. 
We argue that the resulting degree distribution generated is a natural fit to an exponential 
distribution. 

• We also present a mean-field based analysis for the model and present a mathematical as 
well as intuitive analysis of the model. 

• We verify that the diameter of the resulting network matches that of real-world power grids 
through numerical examples. 

This work connects naturally with two important research issues connected with power grids: 
a. communication over grids; and b. analysis of the vulnerability of a power grid. Once a simple 
but universal generative model for power grids is understood, it can be used as a basis to 
understand mechanisms for effective communication over grids, as well as to uncover the main 
sources of vulnerability within the grid. 

The diameter of a graph is a measure of the longest route between any two nodes in the graph 
and its scaling with respect to the number of nodes in the network thus represents the time needed 
for the flow of information over the network. The presence or absence of small-world behavior 
in a power network indicates the existence of hubs (nodes/buses with high degree) which directly 
influences the probability of cascading failures in the network given directed/random attacks on 
the network. Characterizing the diameter and small-world behavior of graphs can play a key role 
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in determining mechanisms that render it resilient to passive failures or active attacks. 

The rest of this paper is organized as follows. The next section presents a description of the 
generative model for the degree distribution of large power networks. The degree distribution 



is analyzed and numerical results are provided in Section III The diameter calculations of our 
model for different sizes of the power network are discussed in Section IV Results, effects and 
future topics of study are presented in Section |Vj 

II. Generative Model for exponential degree distribution 

As discussed in the introduction, large power grids are often found to possess an exponential 
degree distribution. Now, we present a generative model based on the framework of 2-D spatial 
Poisson point process theory. We assume that nodes/buses are randomly placed in space according 
to a Poisson point process P\ with density A. However, unlike a majority of existing literature 
on random geometric graphs where all nodes are assumed to be already present, our goal is to 
obtain a generative model that reflects the key characteristics of the power grid. In other words, 
we assume that edges/links between nodes are formed as and when new nodes enter the system 
(join the power grid). In our model, every new node in the system connects to K of its nearest 
neighbors (preexisting nodes in the network) where K is a random variable following a given 
distribution. We first consider K to be a constant (k) and show that the analysis extends to 
cases where K is a random variable. Note that K affects the average degree of the network. 
Thus, unlike a random geometric model, longer edges in our generative model are more likely 
to occur in the early phases due to the restriction of edge formation with only existing nodes 
in the graph which can be born quite far apart in the geographical area. The nodes numbering 
1 to iV (the size of the network) are generated according to the aforementioned Poisson point 
process. This can be justified based on the fact that real power networks have evolved over time 
and increased in size to their current state. In other words, not all buses are formed at the same 
time. Thus, links are formed when new nodes/buses are established over a given geographical 
area (the creation of new nodes can be modeled by a random process) and they connect new 
nodes to the existing buses in the network. The final state of the network may have 2 nodes 
A and B quite close together but there may be a link from A to another node C instead of B 
despite A being further away from C than from B. This can happen if A and C were formed 
quite early and within a small time interval of each other and B was formed much later in the 
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generation process. 

The formation of K connections for each new node in the network can be seen as inducing a 
measure of reliability within the grid. Meanwhile, the selection of 'nearest neighbors' instead of 
'random nodes' can be explained in terms of minimizing the cost of interconnects involved in 
these connections. Thus, in our generative model, each new node forms K -redundant minimum 
cost connections into the existing network. Note that this is one of the more natural explanations 
for the organic evolution of power grids across the globe. Next, we find that such a simple 
explanation is indeed reasonable, as it leads to an appropriate exponential distribution model for 
the resulting physical network. 

III. Mean field analysis of the exponential degree distribution 

The generative model for the power network assumes creation/ birth of a new node at every 
time step and formation of K new links from the new node to its K closest neighbors. Thus after 
iV time steps, we have a iV bus system. We begin with a given circular area R of known radius 
r and generate points according to a Poisson point process of density A. If K were a constant, 
then we form a complete network with the first K + l nodes. At every time- step t > K+l, the 
probability of a new node joining a previously present node A present in the area depends on 
the region of influence of the node A where 'region of influence' refers to the area Rai within 
R such that for every point p within Rau node A is amongst the K nearest neighbors of p at 
time t. We also assume the absence of two or more links between the same two nodes. Thus, the 
evolution of the model and the degree distribution follow from the evolution of a higher order K 
Voronoi-region defined by the Poisson point process of density A over R. In particular if K is a 
constant at 1, the process reduces to formation of links to the nearest neighbor and thus follows 
the basic Voronoi-region evolution of a 2-D Poisson point process. Such a Voronoi-region for 
a 20 point system is shown in Figure [T] The dots represent the nodes which already exist and 
probability of formation of a edge to a particular node is directly proportional to the area in its 
Voronoi-region. 

Such processes are non-trivial to analyze and closed form results are not available for higher 
order Voronoi-regions. Therefore, we begin by using a mean-field framework for analyzing the 
degree distribution. The main principle behind a mean-field formulation is to replace the dynamics 
of a single point, bus or node with the average dynamics of the entire system at every time step. 
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Fig. 1. Voronoi-region formation for a 20 point system with single link formation per node 



A good reference for mean-field techniques for analysis in the domain of network sciences can 
be found in [18]. By using a continuous time approximation of the system, for every time step 
t, using a mean-field framework, we can view the dynamics of an existing node i as follows: 
Let di(t) be the degree of node i at time t. 

Let v it , P(vit) be the volume of region of influence of i and its probability. Let K be a constant 
value at k. 

d{di(t)} _ ^ ^ among the fc-nearest neighbors of node t at time t) 

at 

cc E(v it ) = E{v t ) (1) 

At every time t, the expected value of the volume of the k th order Voronoi-region for t nodes 
depends only on their positions (which are random) and not on the order in which the nodes 



7 



entered the network. Since at every time T , k links are formed, 



E 



d{di(t)} 



= k 



dt 




=>- di(t) - k = k\og{t/i) 



di(t) = k(l + log(f/i)) 



(2) 



Thus, di{i) decreases with i. So to get the CDF of the degree distribution d(t), we use: 

for some degree d, di(t) < d 

k(l + log(*/i)) < d 
\og(t/i) <d/k-l 



Thus, we get an exponential degree distribution with density A = k. For a different density 
including non-integral values, we can assume that K is no longer constant but a random variable, 
such that with probability a, k\ neighbors are induced; and with probability 1 — a, k 2 neighbors 
are induced in the graph. For such cases, the density A is aki + (1 — a)k 2 . 

Exact Analysis and Proof of Validity: The previous mean field-analysis depends on a 
continuous-time approximation of the generative process. We can look at the process in discrete 
steps as well. Let N(m, t) represent the average number of nodes of degree m at time t with 
K = k (constant) being the number of new links formed at each step. Now at time t + 1, the 
average number of nodes of each degree m increases if the new node coming in at time t + 1 
forms links with nodes of degree m — 1; and decreases if links are formed with existing nodes of 
degree m. The probability of the average number of nodes of a particular degree m increasing 
at a time step thus depends on the number of nodes of degree m — 1 at the previous time step. 
Mathematically, 



=^ i >te k 



P(d(t) < d) 



(t-i)/t 



P(d(t) < d) 



d-k 

1 - e k 



(3) 
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Ar/ . N(m-l,t)k N(m,t)k 
N(m,t + l) - N(m,t) = — — — — — h 5(m = k) (4) 



For m = k , we have 



N(k,t+ 1) - N(k,t) = + i (5) 

t 



For large t, using Lemma 4.1.1. in |fT9ll , this becomes 

N(k,t) 1 
t ~~ 1 + k 

For m 7^ k , 



(6) 



k. N(m — 1, t)/c 
N(m,t + l) = N(m,t)(l--) + — ± ' ; (7) 



For large t, again using Lemma 4.1.2 in |fT9ll , we obtain 

iV(m, t) _ lim^oo N( s m ~ 1 ' t ) k 

t ~ 1 + k 

N(m,t) _ k (m _ fc) 1 



(8) 



t 1 + 1 + Jfe 

So , average fraction of nodes n(d) with degree less than d can be obtained from 

=► n(d) = 1 - (j^)^ 

n ( d ) = 1 - (i±V" d ) 
A; 

=► ^) = l-((l + ^) fe )-^ 

n{d) = 1 - (9) 

The validity of using a mean-field model of average degree distribution to represent an actual 
degree distribution can be established using the Azuma-Hoeffding's inequality [fT9l . Here, we 
take the number of nodes of a given degree m at time t, i.e, Z(m,t) as a random variable and 
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the location of the incoming nodes yi, y 2 etc. as the variables influencing the random variable 
Z(m,t). We obtain a martingale Zi(m,t) by 

Zi(m,t) = E(Z(m,t)\y u y 2 ...,yi) (10) 

where N(m,t) = Z (k,t) and Z(m,t) = Z t (m,t). 

We have | Z^ (m, t) — Zj_i(m, t) | < 2A; as number of nodes of a particular degree cannot change 
by more than 2k at each time step. Thus, using the Azuma-Hoeffding inequality, we have 

x 2 

P(Z(m, t) - N(m, t)>x)< => P(Z(m, t) - N(m, t) > x) ->■ as t 

This proves the validity of the mean field analysis. 

A. Exponential Fit of the Generation Model 

We show the validity of the mean value analysis for the generation model by fitting the 
degree distribution achieved from the generation model for different densities with exponential 
distributions as shown in Figures [2} [3] and |4j We consider Poisson point process over the specified 
area and simulate positions randomly several times (500 iterations each) and then average our 
findings to plot the degree distribution. In Figure [2[ we show the degree distribution for 2 different 
values of K = k and A respectively. It can be seen that changing A increases only the number 
of nodes formed and doesn't change the slope of the graph which depends on the fixed value of 
k alone. In Figure |3j we again keep K fixed and fit exponentials to match the plot of the p.d.f.s. 
The variable K case is shown in Figure [4] where K takes values from a distribution over 3 
values (3, 4 and 5). Here, we illustrate the exponential nature of the p.d.f.s by fitting exponential 
distributions to them. 
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Fig. 2. Degree distribution for Poisson generated nodes for fixed K in radius 20 
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Fig. 3. Fitting exponential p.d.f. to the degree distribution for fixed K in radius 20 
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Fig. 4. Fitting exponential p.d.f. to the degree distribution for variable K in radius 20 



Though exponential fitting of the p.d.f.s of the degree distribution of the generative model 
accurately models the first order behavior, we still see the tails of the degree distribution decaying 
faster than at an exponential rate. To better represent this in a fitting model, we use p.d.f.s from 
the exponential family of random variables to fit the data next. 

B. Exponential Family Fit of the Generation Model 

Exponential families cover a large group of probability distributions which encompass com- 
monly used ones like Gaussian, exponential, binomial, gamma, Weibull, Poisson distributions. 
They have the unique property of having the maximum entropy given constraints on expected 
values (mean, variance etc.). Working with the degree distribution of the nodes in the power grid 
graph, we need the average degree of the fitting p.d.f. to match the empirical average degree. We 
begin with a combination of exponential and gamma distributions to fit the empirical p.d.f. We use 
the software package CVX [17] to optimize for the combination of the considered distributions 
to get the best fit. We first threshold the maximum degree of a node to one less than the value 
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for which the complimentary cumulative distribution function (CCDF) of the degree distribution 
falls below 1CT 10 . Higher degrees are thus ignored as they contribute negligibly to the CCDF 
and the corresponding p.d.f. We then fit the p.d.f. up to this maximum degree using CVX by 
minimizing the L-2 norm of the difference between the empirical and the fitted p.d.f. In Figures 
[5]and[6j we show the p.d.f. obtained for fixed and random values of K (number of edges formed 
by an incoming node) respectively. The performance of the fits are better seen in the linear scale 
as shown in Figures [7] and [8] 
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Fig. 5. Fitting a combination of exponential and gamma p.d.f.s to the degree distribution in radius 20 for fixed K 
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Fig. 6. Fitting a combination of exponential and gamma p.d.f.s to the degree distribution in radius 20 for variable K 



0.25 r 



0.2 



X = 1.8, k = 3 
fit 



0.15 



T3 



0.1 



* 
1 

1 



t 
« 



0.05 



X of Poisson distribution = 1 .8 
k = number of links per new 
node 



10 20 30 

degree of nodes 



40 



50 



Fig. 7. Fitting a combination of exponential and gamma p.d.f.s to the degree distribution in radius 20 for fixed K 
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Fig. 8. Fitting a combination of exponential and gamma p.d.f.s to the degree distribution in radius 20 for variable K 



We proceed to use another set of fits using the Weibull distribution together with the exponen- 
tial and gamma distributions as before. Figures [9] and 10 show the improvement in performance 
by the addition of Weibull distribution to the fitting process. On an average for different cases 
(fixed and variable K), the addition of the Weibull distribution leads to a decrease in the L-2 norm 
of the fitting error from to 1CT 4 while using the same number of combining distributions 
to model the empirical p.d.f. (around 10). In Figure [9j we show the performance for a fixed 
value of K and in Figure ITOl the p.d.f. is fit for a variable K. 
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Fig. 9. Fitting a combination of exponential, gamma and Weibull p.d.f.s to the degree distribution in radius 20for fixed K 
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Fig. 10. Fitting a combination of exponential, gamma and Weibull p.d.f.s to the degree distribution in radius 20 for variable K 
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Fig. 11. Diameter of the network v/s size of the power grid network 



IV. Diameter of our generative model 

In general, due to different topological constraints, power grids vary in their connectivity 
parameters and average degree. Thus, the scaling of the diameter of the network graph with the 
number of nodes is hard to quantify. Using the data given in [fT51l , Ifl6l , we compare power grids 
with similar average nodal degree and observe that the average diameter of the network scales as 



the logarithm of the network size (number of nodes). Figure 11 illustrates this finding. We carry 
out diameter analysis of our proposed generation model to find out small world characteristics 
as mentioned earlier. Simulations are run for different values of N (number of nodes) and K 
(number of connections per incoming node). The results show a similar scaling between the 
diameter of the network and the size of the network as found in real networks. This is in 
accordance with the approximate analysis of diameter for graphs in [1T81 as shown below. This 
shows that for an exponential degree distribution, the diameter should scale as the logarithm of 
the number of nodes in the graph (small world). 



17 



A. Approximate analysis of diameter of a graph 

Consider a graph with a given degree distribution with p.d.f. P(d) and average degree given 
by d av . Now the diameter is the maximum number of hops needed to get from any node to 
any other node in the graph. Assume that the degrees of nodes sharing an edge are independent 
of each other. Choose a random node A. Then, the degree distribution of a randomly chosen 
neighbor of A is given by P ^f >d . Thus, the total number of nodes at a distance of two hops from 
A through the chosen neighbor is given by ^2 d (d — 1) p ^ d or <d2 >~ d ^ and the total number 
of two hop neighbors of A is given by d av <d2 >~ d ^ , Expanding the search for nodes at greater 
distance from A by considering the ones at a distance of two hops and so on, we find the 
estimate of the diameter as the number I such that 

em \ r ) = n ~ i (id 

^ x = jQgECjV; - !)(< d 2 > -2d av ) + dU - logKJ (12) 
log[< d 2 > -d av ] - log[d av ] 

Here, d av = 2k and < d 2 >= 5k 2 where K = k is the minimum number of links formed by 

each node and is obtained based on an exponential C.D.F. obtained through mean field analysis. 

This gives 

log[(JV - l)(5fc 2 - 4k) + 4k 2 } - log[4A; 2 ] 



I 



log[5k 2 - 2k] - log[2fc] 



This is verified using Figure 12 for a network developed using our generation model. This 
highlights the dependence of the diameter on N. We also find a similar dependence on iV for 



the average path lengths between nodes of the simulated network as shown in Figure 13 



To showcase the relationship between K, the minimum number of links per node of the system 
and the diameter of the network, we present simulation results for our model and observe the 



decaying behaviour behavior depicted in Figure 14 This is due to the fact that as K increases, 
more edges/links are formed at each time- slot leading to a decrease in diameter. 




Fig. 13. Average path length of the network v/s size of the network TV 
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V. Results and Conclusion 

In this paper, we provide a generative model for the network representing power grids. The 
model generates a graph representation for the power grid where new nodes are assumed to appear 
randomly at different locations within a given geographical area and form links with K nearest 
neighbors. We demonstrate the efficacy of the model by analyzing its degree distribution and 
diameter. The degree distribution of the generative model has a good fit with the exponential 
degree distribution observed for power grids and reported in literature. We further provide a 
mean-field analysis of the generative process and show that it gives rise to an exponential degree 
distribution with the same average degree as observed in simulations of the model. The diameter 
of the network has implications for the spread of information and vulnerability of the power grid. 
We measure the diameter of the network given by the generative model and show a logarithmic 
scaling with the number of nodes. We verify it by comparing it with the approximate analysis 
of network diameter given in lfT8l . The scaling is similar to observations in real power grids 
with comparable average degree of the network. Other topological features of the the power grid 
include features like 'betweeness', clustering coefficient etc. which need to be studied to better 
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understand the practical viability of our model. This is the focus of our future research in this 
domain. 
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